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Higher derivative scalar field theories have received considerable attention for the potentially ex- 
planations of the initial state of the universe or the current cosmic acceleration which they might 
offer. They have also attracted many interests in the phenomenological studies of infrared modifi- 
cations of gravity. These theories are mostly studied by the metric variational approach in which 
only the metric is the fundamental field to account for the gravitation. In this paper we study the 
higher derivative scalar fields with the metric-affine formalism where the connections are treated 
arbitrarily at the beginning. Because the higher derivative scalar fields couple to the connections 
directly in a covariant theory these two formalisms will lead to different results. These differences 
are suppressed by the powers of the Planck mass and are usually expected to have small effects. 
But in some cases they may cause non-negligible deviations. We show by a higher derivative dark 
energy model that the two formalisms lead to significantly different pictures of the future universe. 
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I. INTRODUCTION 

O ' 

The scalar fields have extensive applications in cosmology. They were used in building the early universe models, 
such as inflation [J-Q, bouncing cosmology [4j, cyclic universe [f|, pre-big-bang [f| and so on to generate the initial 
j-jjrj. conditions of the structure formation. They were also modeled as the mysterious dark energy to drive the current 
accelerating expansion of the universe Q. Among these applications, the higher derivative scalar field models attracted 
special interests. The non-degenerate higher derivative theories (in which the Lagrangian density depends on the higher 
derivative terms non-linearly) were proposed in Ref. [1[ to construct single field quintom dark energy models Q which 
I \ cross the cosmological constant boundary during evolution and violate the null energy condition. Such theories were 
also applied in the early universe to build inflation models II fj , a nd, due to the property of violating the null energy 
condition, to realize the non-singular bouncing cosmology [111. Il2j. Such non-degenerate higher derivative scalar field 
models are plagued by the presence of the ghost degrees of freedom which lead to the quantum instability even though 
ly-^ ' they can be classically stable. Another motivation to study the higher derivative scalar field theories is stimulated 
by the recently proposed Galileon models [l3j]. The Galileon models are viewed as local infrared modifications of 
Cn| 1 general relativity (GR), generalizing an effective field description of the DGP model [14j. Specifically, the Galileons 
are higher derivative scalar fields that nevertheless have second-order equations of motion and are invariant under the 
"galilean" symmetry, <fi(x) — > 4>{x) + c + b^x^ , where c is a constant and & M are the components of a constant vector. 
The second order equations of motion for the Galileons make the theory free from propagating extra ghostly degrees 
of freedom. Strictly speaking, this picture is only valid in the Minkowski spacetime. In the curved spacetime, not 
all the higher derivative terms in originally proposed Lagrangian density for the Galileon [l3j can guarantee a second 
order equation of motion. To eliminate the higher order derivatives in the equation of motion, some non-minimal 
couplings between the curvature and the scalar field have to be introduced [l5j . However, in this resulting theory that 
was in fact discovered by Horndeski about four decades ago [l6| , the galilean symmetry is broken. After relaxing the 
galilean symmetry and without considering the non-minimal couplings to the curvature, the higher derivative scalar 
fields which only have second order equations of motion must have degenerate Lagrangian densities, that means the 
Lagrangian densities depend on the higher derivative terms linearly [T7| . Such type of models present novel interesting 
phenomena. They correspond to imperfect fluids [Til . [l9| and more importantly may violate the null energy condition 
without any instability in the fluctuations. Due to this advantage, they are used to realize the quintom dark energy 
scenario [l7|, [13 and to address issues in the early universe through inflation [2(| and non-singular bounc e f2li . Some 
other phenomenological studies of the Galileon or generalized Galileon models can be found in Refs. j22rl29j |. 

Usually these higher derivative scalar field theories are studied through the metric approach. Like in standard GR, 
the spacetime is assumed to be pseudo-Riemannian and the metric is the only fundamental field to account for the 
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gravitation. The connections have a prior relationship with the metric tensor, T M po . = \g^ \d p g va + d G g pl , — d u g p „) 
which are called as the Levi-Civita connections or the Christoffel symbols in the literature. All the variables like the 
curvature and the covariant derivative operators are defined by these Levi-Civita connections and the metric tensor. 
To derive the gravitational field equations from the variational principle, we need only vary the given action with 
respect to the metric. 

As we know, there is an alternative variation approach in which the connections and the metric are taken as inde- 
pendent quantities. To get the field equations, one must vary the action with respect to both of them. This approach 
is called as the Palatini formalism or more precisely the metric-affine formalism and studied many times in the litera- 
ture, see for example (30l - [33| . In the differential geometry, the connections are used to define the parallel transports of 
the tensor fields in the manifold and have independent geometric meanings. So the metric-affine formalism is usually 
thought to be more natural than the metric approach. If the matter fields minimally couple to the metric and do not 
couple to the connections directly, and the action of gravity is the Einstein-Hilbert type, the metric-affine formalism 
lead to the same equations with GR. However, in the following cases, two approaches will predict different dynamics: 
(i) modified gravity theories, for example the extensively studied f{R) theories [32|,[33j]; (ii) non-minimal couplings 
between the matter fields and the gravitational field [34| ; (iii) the matter fields couple to the connections directly, for 
example when the spinor fields are present, their Lagrangian densities should contain the connections explicitly [35| . 

The higher derivative scalar fields must also couple to the connections directly as a result of covariant theories. 
So using the metric-affine formalism, some differences from the results studied through the metric approach are 
expected. These differences may be taken as to account for the deviations of the spacetime from pseudo-Riemannian 
or as modifications to the scalar field equation. We will take the second viewpoint and derive the effective Lagrangian 
density of the scalar field from the metric-affine formalism, from which we can see that the modification terms are 
suppressed by the powers of the Planck mass and usually have small effects. However, in some cases where the original 
Lagrangian density itself contains the higher order terms these modification terms could be significant. 

This paper is organized as follows. In Section II, we simply review the metric-affine formalism; in Section III we 
study in detail what would be the differences produced by the metric-affine formalism in the higher derivative scalar 
field theories; in Section IV we illustrate the significance of the modifications brought by the metric-affine formalism 
in a simple model; Section V is the summary. 

II. THE METRIC-AFFINE FORMALISM 

As mentioned in the Introduction, with the metric variational principle, the spacetime is assumed to be pseudo- 
Riemannian throughout and fully specified by the symmetric metric tensor. The affine connections are chosen to 
be a priori the Levi-Civita connections, which are symmetric r M = r M and metric compatible V p g p<T — 0. The 
gravitational field equation is obtained by varying the action only with respect to the metric. In the standard GR, 
we adopt the Einstein-Hilbert action for the gravity, 

M 2 ( 

S 9 = ^J d ' x VgR , (i) 

where Mp = l/y/8ivG is the reduced Planck mass and g — —det\g p „\, R is the curvature scalar which is finally written 
as a function of the metric and its derivatives via the relations of the Levi-Civita connections to the metric. 

However, in the metric-affine approach, the connections are treated independently, they have no a priori relationship 
with the metric tensor. We will use symbols with hats to distinguish the variables and operators associated with the 
general connections from those with the Levi-Civita connections. First we use these general connections to define the 
covariant derivative 

V P T£ = d,F$ + t%T? - t%TS , (2) 

where is an arbitrary tensor field. The general connections are not limited to be symmetric, the antisymmetric 
part is called as the torsion tensor 

^ = i(f^-f%). (3) 

Furthermore, these connections may not be metric compatible, the covariant derivative of the metric tensor is by 
definition named as the non-metricity tensor, 

= V p g?° . (4) 
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The torsion and the non-metricity tensors are used to specify the deviations of the geometry of the spacetime from 
pseudo Riemannian. 

Besides the covariant derivative operators, the Riemann tensor is defined the same as that in the metric approach 
except replacing the Levi-Civita connections with the general ones, 

which has no dependence on the metric. Because T is not symmetric in the sub indices, the Riemann tensor has only 
one obvious symmetry: it is antisymmetric in the last two indices. As the same, the Ricci tensor is defined by the 
contraction of the first and the third indices in the Riemann tensor 

R^v = , (6) 

which only depends on the connections and is not necessarily symmetric. Further contraction of the residual indices 
through the symmetric metric tensor gives the curvature scalar 

R = g^R pv . (7) 

To get the gravitational field equation, we should vary the action with respect to both the metric and the connections. 
Naively, the total action is 

S = S g + S m . (8) 
We only consider the Einstcin-Hilbcrt action for the gravity throughout this paper, 

M 2 r 



Sg= 2j d x V99^R^) ■ (9) 
The matter fields (represented by tp) may couple to both the metric and the connections directly, 

S m = / d i x^/g£ m (^p 7 g^u, T) . (10) 



Instead of varying the action with respect to f directly, a better way is to do the variation with respect to the 
tensor which is expressed as the difference between f and the Levi-Civita connections [36j], 

= f % - r% . (ii) 

As we know, the connections are not tensors but their difference is. The C-tensor is also independent of the metric 
and its variation is equivalent to that of V. Similarly we can also separate the Ricci tensor R^ v constructed from the 
Christoffel symbols from the original Ricci tensor R^, 

Rfj,u = Rp.v + V v C a all — VffC^ + C y vp C p atl — C p pa C a Vfi , (12) 

where the covariant derivative operators without hats are associated with the Levi-Civita connections. With such a 
separation the Einstein-Hilbert action we start from becomes 

m 2 f ~ m 2 r 

S 9 = -f J d^Xy/ggTR^ = ^J d^lR + g» v {V v C% - + C° vp C^ - C% a C%)] 

M 2 f 

-> -f J d^[R + g^{C° vp C% - C pa C° vp )\ . (13) 

In the third step of the above calculations, we have neglected the terms containing the derivative of the C-tensor 
because one can find through the Stokes theorem that they only depend on the C-tensor and the metric over the 
boundary on which all the fields are fixed as implied in the variational principle. These surface terms have no effects 
on the dynamics. Please note that the Stokes theorem which works for the covariant derivative without hat does not 
work for a general covariant derivative. 

If the matter does not couple to the connections directly, the matter Lagrangian is independent of the C-tensor. 
The variation of the action with respect to the C-tensor lead to the following equation, 

- c-V£ + c p % + cv " ca ^9 pa = ■ ( 14 ) 
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This is a tensor equation with three independent indices. The contraction of p and p gives 

C a \ = C a aa + 3C aa a , (15) 

and taking the trace on p and a we get 

C*na = 3C"^ M + C pa a . (16) 
The contraction of a and p in Eq. (fT4l leads to the uninteresting identity = 0. From Eqs. (|T5|) and (jTHJ) , we have 

cv = C°\ = \c a \ , (17) 

with it, Eq. (|14p may be rewritten as 

C pffAI + C app = C aa a g pp + C pa a g pa . (18) 

Permutating the indices p, er, p give 

CW a + C pa» = C pa a g p(y + C aa a g pp , (19) 

+ C w<j = C pa a g ap + C pa a g ap . (20) 

Combining the Eqs. ([15]). (HI?)) and (|2T))) one obtains finally 

C* CTW = C pa a g pa . (21) 

This result means the C-tensor cannot be determined uniquely. So it deserves pointing out that even in the case 
that the action of the gravity is Einstein-Hilbert and the matter does not couple to the connections directly, the 
metric-affine approach does not have to produce the pseudo-Riemannian spacetime without further restrictions. This 
is because the gravity has an extra gauge symmetry, the Einstein-Hilbert action is invariant under the following 
projective transformation |30L l3lj 

C p pa ^ C p pa + 5^ p , R^R, (22) 

where £ p is an arbitrary vector field. That means given a solution C p pa we can always find other equivalent solutions 

C p prT = C p pa . + 5 p £, p . Nevertheless this uncertainty does not bring physical differences because in this case the projective 
symmetry is only a gauge symmetry. It does not affect the matter field equation and furthermore from Eqs. (|13l) . 
(1171) and (1211) that it has no contribution to the energy-momentum tensor which sources the gravitational field. If 
we want to get a unique solution we should fix the gauge. This can be done by imposing the restrictions on the 
connections by hand, for example, if we require the spacetime is torsion free S p pa = or the connections are metric 
compatible Q pa — 0, one can show straight forwardly that the C-tensor must vanish and the connections must be 
the Levi-Civita connections. Alternatively we can add some additional terms containing Lagrange multipliers times 
the connections in the action, the variation of the action with respect to the Lagrange multipliers will automatically 
give the constraint equations on the connections. We will discuss this way in more detail in the next section. 



III. HIGHER DERIVATIVE SCALAR FIELD THEORIES WITH METRIC-AFFINE VARIATIONAL 

PRINCIPLE 

The scalar field with covariant higher derivatives must couple to the connections directly. In this paper, for 
simplicity, we only consider the scalar field theories in which the Lagrangian density contains the derivatives up to 
second order. That is 

£ = £ (0, V^, V M V„<£) . (23) 

In the metric approach, all the derivatives are defined through the Levi-Civita connections, so that we can write the 
Lagrangian density as 

£ = <?V, (f>un) , (24) 

where we have defined 



(25) 
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In the metric-affine formalism, the Lagrangian density depends on the connections F through their associations with 
the covariant second order derivatives. With the same strategy used in the previous section, we separate the derivatives 
associated with the Levi-Civita connections from those with hats, 

Vp> - fa , V M V„0 = fa„ - C^fa • (26) 

This means the scalar field is equivalently couple to the C-tensor directly. If the Nth. derivative terms with N > 2 
are considered, it is easy to show that the derivatives of the C-tensor will be also present in the Lagrangian density. 
Including the gravity, the total action we start from is 



-f- J <TxJgR+ j trxy/g£i(4>, fa, V p V„0) + S ext , (27) 

where S ex t is the action of other matter which are assumed to be coupled to the gravity only through the metric. 
This action is equivalent to 

m 2 r r m 2 

S = ~2 J d * X ^ R + J dix ^^f^ v {C a vp C^ - C% a C a vp ) + CM, fa, fa» - C^fa)] + S ext , (28) 

now the second integral at the right hand side of the above equation may be considered as an effective action of the 
scalar field within the framework of GR. The whole action does not contain the derivatives of the C-tensor, so it 
represents some constraints rather than dynamical degrees of freedom. The corresponding constraint equations are 
obtained by varying the action with respect to the C-tensor. Because in the action the C-tensor appears at least 
quadratically (different from the Lagrange multiplier which also represents a constraint), its equations actually express 
the algebraic relations of the C-tensor to the scalar field and its derivatives. Through solving these equations one can 
find the C-tensor expressing as a function of the scalar field 4> an d its derivatives. Then replace Ci CT in the action 
(|28[1 with the function found out we may get the the effective Lagrangian of the scalar field, 

M 2 

= -yV^pC^ - CV<?%J + C^(<f>, fa, fa M - C^fa) , (29) 

where the fact that the C-tensor considered as a function of </>, </> M and fa^ is implied. Now we can see clearly 
the difference between the metric approach and the metric-affine approach when the higher derivative terms of the 
scalar field are involved. In the metric approach, the scalar field exists in the pseudo-Ricmannian spacetime and its 
dynamics is described by the Lagrangian density (|24p . From the metric-affine approach, however, if non- vanishing 
C-tensor is found we can think that the spacetime is still pseudo-Ricmannian but the Lagrangian density of the 
scalar field gets some modifications or "corrections" . Of course the non- vanishing C-tensor may be also geometrically 
interpreted as describing the deviations of the spacetime from pseudo-Ricmannian. In this paper we will adopt the 
first interpretation because it is more convenient for us to compare the different dynamics produced by these two 
approaches. 

Nevertheless before going ahead, there is a subtlety should be considered for the metric-affine formalism. The 
variation of the action (|2"7|) or (|28[1 with respect to the C-tensor gives the equation 



C™J? + C"\ + CV - C"V<r = W ^fa , (30) 



where 



w pa ^ _J^> _ (31) 

d(V p V CT 0) 

The left hand side of Eq. (|30j) when contracting a and fi vanishes identically as we have mentioned in the previous 
section, this requires w ptJj V p 4> = 0. However this requirement put a very strong constraint on the scalar field and 
in many cases it only leads to some trivial solutions which are not interesting for the dark energy or early universe 
dynamics. In general case w^cj)^ does not vanish and this makes the whole theory inconsistent. The reason of this 
inconsistency is that the Einstein-Hilbert action is invariant under the projective transformation (|22[) but the general 
action describing the scalar-connection coupling is not. To avoid this inconsistency, we should break the projective 
symmetry in the gravity sector. One choice is to extend the Lagrangian density of the gravity to include higher 
order terms like R^R^ and so forth, but this will introduce extra degrees of freedom. Another choice is imposing a 
constraint on the connections by adding in the total action a term Sl which containing some Lagrange multipliers. 
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This is similar to a gauge fixing in the gauge field theory. The number of degrees of freedom we need to fix is four, 
i.e., the number of the components of the four- vector used for the projective transformation (1221) . 

We will follow Refs. [3l| and [33| to consider two kinds of Lagrange multipliers which put single vector constraints 
on the connections. The first kind of the Lagrange multiplier is 

Sli = J ^XyfgA^S^ , (32) 

where = (l/2)(C p p/J — C p ) is the torsion vector and is the Lagrange multiplier. With this additional term 
Eq. (f3"U)) is modified as 

- CVS + C "\ + C %" ~ C V = ^K^m + \^ A " - \^ A<T ) ■ (33) 
The variation with respect to A 1 - 1 gives the constraint 

S, = , i.e., C% = C% . (34) 
These equations are consistent. The contraction of a and /i in Eq. (|3"3")l gives 

A p = -\w pa <t>a ■ (35) 
With the same procedure from Eq. (fT4f to Eq. (|2"Tj) . we can derive that 

Mp I 

+ -{-w pa <f) a + 3w ap <j) a - Zw a a cf) p )g' jp + <t> p w pa + cj ) p w' jp - ^w^P] . (36) 
6 

From the above equation we can solve for the C-tensor as a function of (j> and its derivatives, then substitute this 
function in Eq. (|2^|) . we will get the desired effective Lagrangian for the higher derivative scalar field. We see from Eq. 
(|36|) that the C-tensor is suppressed at least by the square of the Planck mass, so in the resulting effective Lagrangian 
density (f2T))) the modification terms are also suppressed at least by Mp. 

The second kind of the Lagrange multiplier we consider in this paper is described by 

S L 2 = J d^x^B^Q^ , (37) 

where Q M = (l/4)Q Mp p = (l/2)C p pp is the Weyl vector and B p is the Lagrange multiplier. With this, the equation 
for the C-tensor becomes 



2 , „„ I 

and the constraint put by the Lagrange multiplier is 



C™J P + C p \ + C\ P ~ CV<T = ^K^m " ^B p ) , (38) 



Qp = \C\ P = ■ (39) 



Similarly we get the result 



C° pp = * [i(-«,«"^ a _ 2w a ^ a + 2w\^)g pp + hiw^K - 2w a ^ a - 2w a a ^)g pa 

Nip 4 o 

+ j(~w pa (j) a + 2w ap (j) a - 2w a a cj) p )g af " + (f) p w pa + (j) p w ap - . (40) 

Comparing with Eq. (|36[) . this result only have the differences in some numerical factors. 
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IV. A SIMPLE MODEL 



Now we use a simple model to illustrate whether the differences caused by the metric-afhne formalism are significant. 
The model has a Lagrangian density degenerate in the higher derivative terms, 

£ = /(</>, A)O0 , (41) 

where X = ^<j> fl <j> 11 , / does not depend on the second derivative of the scalar field and the d'Alembert operator is 
defined as D e g^V^V,,. For the general connections we should be careful about the order of the metric and the 
derivative operators. We can find that w ( " T is symmetric 

w» a = fgP° . (42) 

With the first kind of Lagrange multiplier, the result (|36p is simply 

c ° w = ~Mr I P {(t>P - r + ^ gPa) ■ (43) 

In geometric language, because C afJ,p is symmetric about fi and p, the torsion tensor vanishes. But the non-metricity 
tensor is non-zero, 



2/ 

The effective Lagrangian density what we need is 



qt = -g^r iw" + W" + 2 ^ pa ) • ( 44 ) 



4/ 



2 



4 = /°* +3^*- (45) 
So, we can see from the above equation that comparing with the metric approach the metric-affine formalism brings 

4 f 2 

the extra term -^prX to the Lagrangian density. This modification term is suppressed by the square of the Planck 

mass and in many cases cause minor differences in the dynamics. However, if the original Lagrangian itself relies on 
the higher dimensional operators, this modification may have important effects. For example, consider the model in 
which 

where c is a dimensionless constant. This model has been studied with the metric approach in Refs. [I?], [Hi f° r 
addressing the issue of dark energy and in Ref. pll | for bouncing universe. With the metric approach this model is 
symmetric under the field shift <f> —></> + C with constant C. Correspondingly the conserved current is 

J" = -^3-(V M x-^n<£)+<^ . (47) 

In the spatially-flat Friedmann-Robertson- Walker universe, the conservation law demands that the "charge density" 
J° = 6 — ■yts-HS 2 scales as a -3 , where a is the scale factor of the universe, H is the Hubble parameter and the dot 

represents the derivative with respect to time. This makes the analysis of the dynamics gets large simplification. Here 



we only review the main steps, for more details see Ref. 17 1. In the expanding universe J° dilutes quickly and the 



equation of state of the scalar field is asymptotically w ~ — 1 = — (2 + Wb), where Wb is the equation of state of 
the component dominating the universe. So in the matter dominated era w = —2 and if the universe is dominated by 
the scalar field itself w = — 1 . As the dark energy this model can account for how the universe shifts from the matter 
domination to the phase of the accelerating expansion. This dark energy model has a feature that the universe will 
end up in the de Sitter space. 

However if we start from the same Lagrangian density but adopt the metric-affine formalism, the model (|46[) is not 
shift symmetric. We can see that the effective Lagrangian density is 

C '* =L * + m x ^ 2 + m x2 ^ + m x) ^ ( l + w P w + m x * 2 + + m x) • (48) 
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Among the three modification terms, the last two terms are suppressed by much higher order powers of the Planck 
mass and are expected to have negligible effects. But the first correction term — -p- A^> 2 has lower order than the higher 

dimensional term -^-XOcj) originally exists in the Lagrangian we start from. In fact this term can be considered as an 

effective mass for the scalar field. Its effect on dark energy has been investigated in Ref. [I?} • With this effective mass 
the whole picture of the future universe is changed. The universe will not end up in the de Sitter space, the equation 
of state of the scalar field will increase from —1 to 1, then its energy density will decrease very fast, faster than the 
energy density of the dark matter, and the universe will return to the phase of matter domination. It deserves pointing 
out that such like effective mass has also been introduced in inflation model [13, [H| to suppress the amplitudes of the 
primordial perturbations. 

V. SUMMARY 

In this paper we studied the higher derivative scalar fields which received much attention in addressing the issues 
of the early and late universe with the metric-affine formalism. As a covariant theory, the higher derivative scalar 
fields have to couple to the connections directly and as a consequence within the metric-affine formalism they have 
dynamics different from those within the metric formalism. We showed by the method of effective Lagrangian that 
usually these differences are suppressed by the powers of the Planck mass and are expected to be small. However, 
if the Lagrangian density we start from itself relies on the higher dimensional operators the modifications brought 
by the metric-affine formalism might introduce important deviations. We used a degenerate higher derivative dark 
energy model to illustrate that these two approaches lead to significantly different pictures of the future universe. In 
other applications of the higher derivative scalar fields, for examples the inflation models, the non-singular bouncing 
universe, and the covariantization of the Galileons, we believe the metric-affine formalism as an alternative to the 
metric formalism deserves more attention. 
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